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POISSON CALCULUS FOR SPATIAL NEUTRAL TO 
THE RIGHT PROCESSES 1 

By Lancelot F. James 

Hong Kong University of Science and Technology 

Neutral to the right (NTR) processes were introduced by Doksum 
in 1974 as Bayesian priors on the class of distributions on the real line. 
Since that time there have been numerous applications to models that 
arise in survival analysis subject to possible right censoring. However, 
unlike the Dirichlet process, the larger class of NTR processes has not 
been used in a wider range of more complex statistical applications. 
Here, to circumvent some of these limitations, we describe a natural 
extension of NTR processes to arbitrary Polish spaces, which we call 
spatial neutral to the right processes. Our construction also leads 
to a new rich class of random probability measures, which we call 
NTR species sampling models. We show that this class contains the 
important two parameter extension of the Dirichlet process. We pro- 
vide a posterior analysis, which yields tractable NTR analogues of 
the Blackwell-MacQueen distribution. Our analysis turns out to be 
closely related to the study of regenerative composition structures. A 
new computational scheme, which is an ordered variant of the gen- 
eral Chinese restaurant processes, is developed. This can be used to 
approximate complex posterior quantities. We also discuss some rela- 
tionships to results that appear outside of Bayesian nonparametrics. 

1. Introduction. Doksum [9] considered a nonparametric Bayesian anal- 
ysis based on neutral to the right (NTR) priors. These priors are random 
probability measures defined on the real line, 1Z, that include the popu- 
lar Dirichlet process (see [13] and [16]). Within Bayesian nonparametric 
statistics, the NTR process serves as one of the important classes of mod- 
els. In particular, there have been numerous applications to models that 



Received May 2003; revised March 2005. 

Supported in part by RGC Grant HKUST-6159/02P and DAG 01/02. BM43 of HK- 
SAR. 

AMS 2000 subject classifications. Primary 62G05; secondary 62F15. 
Key words and phrases. Bayesian nonparametrics, inhomogeneous Poisson process, 
Levy processes, neutral to the right processes, regenerative compositions, survival analysis. 



This is an electronic reprint of the original article published by the 

Institute of Mathematical Statistics in The Annals of Statistics, 

2006, Vol. 34, No. 1, 416-440. This reprint differs from the original in pagination 

and typographic detail. 



1 



2 



L. F. JAMES 



arise in survival analysis subject to possible right censoring. On the other 
hand, unlike the Dirichlet process, the larger class of NTR processes has not 
been used in a wider range of statistical applications. That is, for instance, 
there are no general NTR analogues of the important class of (kernel based) 
Dirichlet process mixture models. See, for example, [32] and [23] for further 
background and references on Dirichlet process mixture models. 

A goal of this article is to begin to answer the question of how one can 
possibly use NTR processes in a wider context, as has been the case for 
the Dirichlet process. One of the limitations of NTR processes is that they 
are only defined on the real line. The other limitation, which is perhaps 
more severe, is that as of yet we do not have tractable NTR analogues of the 
Blackwell-MacQueen [3] Polya urn distribution associated with the Dirichlet 
process. The Blackwell-MacQueen distribution is well known to be the ex- 
changeable distribution derived from a Dirichlet process, and its theoretical 
understanding and practical implementation are crucial in complex models. 
To circumvent some of these limitations, we describe a natural extension of 
NTR processes defined on an arbitrary Polish space 5? = 1Z + x 3C , which 
we call spatial NTR processes. Here 1Z + denotes the positive real line and 
JT is an arbitrary Polish space. Our construction also leads to a rich class of 
random probability measures on SC , which we call NTR species sampling. 
We provide a detailed analysis of these models and obtain properties analo- 
gous to the Dirichlet process. In particular, we provide a description of the 
posterior distribution of spatial NTR processes and, more importantly, we 
give a detailed analysis of the NTR analogues of the Blackwell-MacQueen 
distribution. 

Such an analysis parallels, in part, the results of Antoniak [1] (see also [12]) 
and Lo [32] for the Dirichlet process. These works involve characterizations 
based on random partitions of the integers {1, . . . , n} and were derived using 
nontrivial combinatorial arguments. The structure of general NTR processes 
is more complex than that of the Dirichlet process and an approach using di- 
rect combinatorial analysis is considerably more challenging. We circumvent 
such issues by applying the Poisson process partition calculus discussed by 
James [24, 26]. This also paves the way for a straightforward derivation of 
the posterior distribution of spatial NTR processes. Using these results, we 
develop a new computational scheme related to the general Chinese restau- 
rant process (see [37], page 60 and [23]), which now allows one to sample 
from the exchangeable distributions derived from NTR processes. 

It is important to note that although Bayesian applications of NTR pro- 
cesses to complex statistical models have been limited, the use of these 
processes appears often in other important contexts. Doksum ([9], Theorem 
3.1) showed that one can describe an NTR distribution function F on 1Z + 
via positive Levy processes, Z, on TZ + as 

(1) l-F(t)=S{t) = e- z ®, 
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where S denotes the survival distribution of a random variable T from F. 
The Levy process Z is an increasing independent increment process that sat- 
isfies Z(0) = and lim^oo Z{t) = oo a.s. That is, T\F has survival distribu- 
tion P(T > t\F) = e~ z ^\ Importantly, the representation in (1) shows that 
NTR survival processes essentially coincide with the class of exponential 
functionals of possibly inhomogeneous, nonnegative Levy processes. Such 
objects and more general exponential functionals of Levy processes, such as 
Brownian motion, have been extensively studied by probabilists with appli- 
cations, for instance, to finance. The NTR models also arise in coalescent 
theory, which has applications in genetics and physics, as seen, for example, 
in [36], Proposition 26. See also [2] and [5]. Noting some of these connections, 
Epifani, Lijoi and Priinster [11] applied techniques from those manuscripts 
to obtain expressions for the moments of mean functionals of NTR models 
and, as we also do here, highlighted some of the connections to these areas 
outside of Bayesian nonparametric statistics. The mean functional can be 
described explicitly as 

roo roo roo 

(2) /= / tF{dt) = S(t)dt= e~ z{t) dt. 

Jo Jo Jo 

It is a significant object, which has interesting interpretations in a variety of 
fields. We describe how this process is related to the study of the Blackwell- 
MacQueen analogue derived from NTR processes. Moreover, we discuss how 
our work is closely related to the recent work of Gnedin and Pitman [18] on 

regenerative composition structures. 

2. Spatial neutral to the right processes. Suppose that (T, X) are ran- 
dom elements on the Polish space 5? that have distribution F(ds,dx) for 
(s,x) € r y. Here we would like to extend the definition of an NTR process 
to model F(ds,dx) as a random probability measure such that its marginal 
F(ds, 3£) is an NTR process. While the representation in (1) is quite use- 
ful for calculations, it is not immediately obvious how one can use this 
definition to extend an NTR process to 5? . The known exception is the 
Dirichlet process that can be defined on arbitrary spaces. To do this, we 
first recall that if F is an NTR process on 7Z + , then its cumulative haz- 
ard A(ds) := F(ds)/ S(s-) is a nonnegative Levy process; in other words, 
A is a completely random measure (see [30]). This observation and alter- 
native idea for modeling via cumulative hazards is due to the important 
work of Hjort [21]. Note that Z in (1) is also a completely random measure. 
Moreover, an important aspect of our results relies on the fact that there 
is one-to-one distributional correspondence between a particular Z and A. 
Specifically, if Jj represents a random jump of A taking its values in [0, 1], 
then — log(l — Jj) is the jump of a corresponding Z taking its values in 
1Z + . Hence if we initially model Z and A as completely random measures 
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without a drift component and fixed points of discontinuity, they may both 
be represented as linear functionals of a common Poisson random measure. 
Importantly, one then may give precise meaning to the distributional equiv- 
alences P(T € ds\F) = F(ds) := S(s-)A(ds) := e~ z ^A(ds), where F is an 
NTR process. 

Our construction now proceeds by extending A and Z to completely ran- 
dom measures on J^, using a representation in terms of a Poisson random 
measure. Let N denote a Poisson random measure on some Polish space 
W = [0, 1] with mean intensity 

K[N(du, ds, dx)^] = u(du,ds,dx) := p(du\s)Ao(ds,dx). 

Here p is a Levy density that will determine the conditional distribution of 
the jumps of A and Z. Furthermore, without loss of generality, we assume 
that Jq 1 up{du) = 1 and, hence, up(du) is a probability density function. 
The intensity v is chosen such that A§{ds,dx) := F§{ds,dx) j So(s-) is by 
definition a hazard measure on J?, where Fq represents a prior specification 
for the distribution of F on 5? , and So is the corresponding survival function 
on 1Z + . See [31], A5.3, for formal details of hazard measures on abstract 
spaces. Note that in [28], Proposition 25.28, the hazard measure is also called 
a natural compensator of a random measure defined as St,x ■ We denote the 
Poisson law of N with intensity v as V{dN\v). The Laplace functional for 
N, which plays an important role in our analysis, is defined as 

E[e~ N ^\u] = / e~ N ^F(dN\u)=e-^\ 
Jm 

where for any positive /, N(f) = j w f (x)N(dx) andSf(/) = f w (l — e~^ x >) x 
u{dx), and M denotes the space of boundedly finite measures on W (see [6]). 
A measure, say N, is boundedly finite if for each bounded set A, N(A) < oo. 
See also [28], Chapter 12, for a discussion of Poisson random measures and 
the unicity property of Laplace functionals. 

Now the specifications above imply that A(ds,dx) := Jq 1 uN(du,ds,dx) 
is a completely random hazard measure on 5? with mean E[A(ds,dx)] = 
A()(ds,dx) and there is a corresponding Z(ds,dx) := f [— log(l — u)] x 
N(du,ds,dx). In particular, — log S(t— ) := Z(t—) = J^[—I{s < t}log(l — 
u)]N(du,ds,dx). Now using these facts we define a spatial neutral to the 
right (SPNTR) random probability measure on 5? as 

(3) F(Tedt,Xedx\F) := F(dt,dx) = S{t-)A(dt,dx). 

Defining A(ds) := A(ds, JT), it follows that F(ds) := S(s-)A(ds) is an NTR 
process and, furthermore, K[F(dt,dx)] = So(t— )Ao(dt, dx) = Fo(dt,dx). See 
Section 5 for more details. 
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Remark 1. The choice of 

(4) p(du\s)Ao(ds, dx) = c(s)u~ 1 (l — u) c ^ _1 duAo(ds, dx) 

for c(s) a positive function yields a natural extension of Hjort's [21] beta 
cumulative hazard process to beta processes on 5? . Equivalently, this speci- 
fication defines beta-Stacy or beta-neutral distribution functions on 5? . See 
[33, 41] and [21], Section 7A, for such processes defined on TZ + . The case 
of the Dirichlet process with shape parameter 6Fq is obtained by choosing 
c(s) = OSo(s-). Our construction of spatial NTR processes is influenced by 
the work of James and Kwon [27] , who first gave an explicit construction of 
spatial beta-neutral processes on 5? via ratios of two independent gamma 
processes. 

Remark 2. Given the specifications in (3), we extend this definition to 
include prior fixed points of discontinuity {(si,wi), . . . , (sk, w^)} in 5? as 



(5) F k {ds,dx) 



{!■■■ si<s} 



A.k(ds, dx), 



where A-k(ds, dx) = A(ds, dx) + Ya=i Ui5 SuWl (ds, dx) is defined such that in- 
dependent of A, Uj are independent random variables on [0, 1] with distri- 
bution Hj for j = 1, . . . , k. We call Fk a general spatial NTR process. 

Remark 3. The log mapping that we use can be deduced, for instance, 
from [7] and [8], Proposition 2. This type of correspondence is actually noted, 
albeit less explicitly, in [21] and is also used in related contexts without 
specific mention of NTR processes; see, for instance, [36], Proposition 26. In 
particular, if r is a Levy measure that specifies the conditional distribution of 
the jumps of Z, then by writing r(dy\s) := r(y\s) dy and p(du\s) := p(u\s) du, 
the relationship between the Levy measures of Z and A is described by 

r(y\s) = e~ y p(l — e~ y \s) for y G 1Z + or 

p(u\s) = (1 - u)~ 1 r(-log(l - u)\s) for u € [0, 1]. 

Note that if p(du\s) := p(du), then we say that the relevant processes are 
homogeneous. 



3. Posterior analysis. Similar to the case of the Dirichlet process, we 
consider the following setup. Suppose that (Ti,Xi)\F are i.i.d. pairs with 
common distribution F for i = 1, . . . , n and suppose the law of F is modeled 
as a spatial NTR process. This description yields a joint distribution of 
(T,X) = {(Ti,X\), . . . , (T n ,X n )} and F. We are interested in the Bayesian 
disintegration of this joint distribution in terms of the posterior distribution 
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of F\T, X and the marginal distribution of (T,X). Since A, Z and F are all 
functionals of TV, we work instead with the joint distribution of (T,X, TV), 



\s(Ti-)A(dTi,dXi) 



F(dN\v) 



.i=i 
(6) 

= Tr(dN\T,X)^(dT 1 ,dX 1 ,...,dT n ,dX n ), 
where 7r(dZV|T,X) denotes the desired posterior distribution of iV|T,X and 
Jt(dT, dX) = Jt{dT x ,dX 1 ,...,dT n , dX n ) 

(7) 



M 



jF(dTi,dXi 



i=l 



V{dN\u) 



is the important exchangeable marginal distribution of (T,X). The j$ de- 
notes the general analogue of the Blackwell-MacQueen Polya urn, and hence 
is crucial to both theoretical understanding and practical implementation of 
the general class of spatial NTR processes. We will describe the posterior 
distribution given (T,X) in Section 4, and give a detailed analysis of ^# 
and related quantities in Section 5. We first explain some key elements of 
the analysis. 

3.1. The role of random partitions and order statistics. It is clear that 
one can always represent (T,X) as (T*,X*,p), where, using notation sim- 
ilar to Lo [32], (T*,X*) = {(Tf.Xf), . . . , (r* (p) ,^ (p) )} denotes the dis- 
tinct pairs of observations within the sample and where p = {E±, . . . , E n ^} 
stands for a partition of {1, . . . ,n} of size n(p) < n that records which ob- 
servations within the sample are equal. The number of elements in the jth 
cell, Ej := {i:(Ti,Xi) = (T*,X*)}, of the partition is indicated by for 

j = 1, . . . ,n(p), so that J2j=i e j = n - ^ follows that the marginal distribu- 
tion of (T,X), say can be expressed in terms of a conditional distribu- 
tion of T, X|p, which is the same as a conditional distribution of the unique 
values T*,X*|p, and the marginal distribution of p. The marginal distribu- 
tion of p, denoted as 7r(p) or p(e\, . . . , e n / p -\), is an exchangeable partition 
probability function (EPPF), that is, a probability distribution on p which is 
exchangeable in its arguments and depends only on the size of each cell. The 
best known case of an EPPF is the variant of the Ewens sampling formula 
(ESF) (see [1, 12]) associated with the Dirichlet process with total mass 9, 
given as 

0„(p )rw »M 
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Additionally, since a Dirichlet process is a special case of what are called 
species sampling models, the distribution of T,X|p is such that the unique 
pairs (Tj,Xj) are i.i.d. with distributions Fq. We note that the marginal 
distribution and, naturally, the posterior distribution of the Dirichlet pro- 
cess depend only on the counts ej and the unique values. The structure of 
for general NTR processes is considerably more complex. However, as we 
will explain, what is interesting is that they do have a natural interpretation 
in terms of classical survival models. One can think of T* as the collection 
of the unordered distinct times to death of individuals in a sample of size n. 
In this sense, the count ej represents the number of deaths at time T** . Ad- 
ditionally, it is well known that the posterior distribution of NTR processes 
also depends on the number at risk at a given time, say t, which can be 
defined as Y n (t) = J2i=iI{Ti > t}. We have discovered that to simplify the 
expressions for it is necessary not only to know the number of deaths, 
but also to know the number at risk at the unique times. Instead of working 
directly with T*, we do this by using its ordered values. 

That is, let Tn . n ) > Tr 2 : n) > ' " ' > ^fn(p) : n) denote an ordering of the 
unique values {T^*, . . . , T*, A. This collection represents the ordered unique 
times of death. Note that we work with the pairs (Tq . n ) , Xj ) , where X? is 
simply the unique value treated as the concomitant of T(j. n y That is, we 
do not order the X values; in fact, some spaces S£ do not have a natural 
ordering. Associated with this, let m = {-E(i), • ■ • , -EV n ( p ))} denote the collec- 
tion of sets Eq\ = {i:Ti = T( J:n )} for j = l,...,n(p). That is, E/j\ is the 
collection of values equal to the jth largest unique death time. Similar to ej , 
let mj = \E(j) \ denote the number of deaths at the jth largest unique death 
time, T(j. n \, for j = 1, . . . , n(p). There are of course n(p)! possible orderings 
of T*. This implies that given a partition p = {E±, . . . ,E n /p^}, the collec- 
tion {mi, . . . ,m n ( p )} [resp. (m)] takes its values over the symmetric group, 
say 5 n ( p ), of all n(p)! permutations of {ei, . . . ,e n(p) } [of ({E 1 , . . . ,E n{p) })}. 
Notice now that, for each s, 

n n(p) n(p) 

Y n (s) = ^/{T, >s}=Y, e jHT* > 4 = E ^HTd-.n) > s}. 
1=1 j=l 1=1 

Hence for j = 1, . . . , n(p), we can define rj_i := Y n (T(j : n )) = Sf=i m li which 
denotes the number larger than the jth largest unique value. Note that 
ro = and ^ n (p) = n > additionally, rj = rj_i + mj. What is important is 
that, unlike p, the collection {£"(!) , • • • , Er n ( p \\ } completely determines (rj) 
via the (mj); that is, m contains the relevant information in p. We will often 
refer to (m, p) rather than m to remind the reader of the dependence of m 
on p. 
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Remark 4. See [34, 37, 38] for a general overview of the EPPF con- 
cept and see [23, 24, 26] for its relevance to general marginal exchangeable 
distributions that arise in a Bayesian context. 

Remark 5. One of the earliest applications of the Ewens sampling for- 
mula is in population genetics. It is quite interesting to note that, as de- 
scribed by Donnelly and Joyce ([10], page 230), one may also interpret the 
(T(j :n \) as the ordering of genetic types (alleles) of individuals, where new 
alleles arise by mutation and the alleles present in the population or in a 
sample at a given time may be ordered by age. An interesting by-product of 
our work is that it actually yields the explicit distribution for large classes 
of such models. A simple description will be given in Proposition 5.2. 

4. The posterior distribution of spatial NTR processes. In this section 
we describe formally the posterior distribution of spatial NTR processes 
given the data (T,X). Note here that we will characterize the posterior via 
the ordered values rather than T*. Since we are conditioning on (T,X), 
these are equivalent notions. We first describe the result for no fixed points 
of discontinuity and then discuss how one easily obtains the extension in 
Section 4.1. The proof is delayed until the Appendix. 

Proposition 4.1. Let F be a spatial NTR process defined by the Pois- 
son random measure N with mean intensity v(du,ds,dx) = p(du\s)Ao(ds,dx); 
A is its corresponding Levy hazard measure. Suppose that (Ti,Xi)\F are i.i.d. 
F for i = 1, ... ,n. Then: 

(i) The posterior distribution o/A|T,X is equivalent to the distribution 
of the random measure N* = N n + J2~j=i $J 3 ■ n ,T (j ■. n) ,x* , where, conditional 
on (T,X), N n is a Poisson random measure with intensity 



Additionally, the (Jj, n ) are conditionally independent of N n and are mutually 
independent with distributions specified by 



for j = l,...,n(p). 

(ii) The posterior distribution of A given (T,X) is equivalent to the law 
of the Levy hazard measure, 



(8) 



v n (du, ds, dx) = (1 — u) 



Y n (s) 



p(du\s)Ao(ds, dx). 



P( J jt n £ du\T (j , n) ) OC U™* (1 - Up- V(d7j|T (j : n) ) 




n(p) 

A n (ds,dx) + J j,nST u . n) ,x*(.ds,dx) 
j'=i 
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where A n (ds,dx) = Jq uN n (du,ds,dx) is a Levy hazard measure with Levy 
measure as in (8) and where the (Jj >n ) are conditionally independent of A n . 

(iii) The posterior distribution of the corresponding Z process is equiva- 
lent to the the law of the random measure 

n(p) 

Z*(ds,dx) = Z n (ds,dx) + ^ Z jin d Tu . n)tX *(ds,dx), 

3=1 

where Z n (ds,dx) = Jq[— log(l — u)]N n (du,ds,dx) and each Zj jU = — log(l — 
Jj,n) with distribution 

nZj, n 6 dy\T {j:n) ) := H*(d(l - e"")) oc (1 - e^)^ e -^-V(dy|T (i:ri) ). 

(iv) Additionally, the posterior distribution of F is equivalent to the con- 
ditional law, given (T,X), of the random probability measure F*(ds,dx) 
expressed as 



e -Zn(s-) 



n(p) 

n (i-^>) 

{j:T {j:n) <s} 



n(p) 

A n (ds,dx) + Pj:nO~T U:n) ,x*(ds,dx), 

3=1 



where Pj j7l = e~ z ™( r (^ ™)~) J 3 >nrij+i( 1 ~~ Ji,n)- It follows that the Bayesian 
prediction rule is given by E\F*{ds,dx)\T,~X\, which can be expressed in 
several ways. 

Remark 6. Note that due to symmetry, one has the equivalence in 
distribution of 

n(p) n(p) 

J j,n5T {j]n) ,x*(ds,dx) = Jj,jT*,x*(ds,dx), 

3=1 3=1 

where the random variables (JJ n ) are mutually independent with marginal 
distributions P(J* n G ds\T*) oc u e > (1 - uf"^ p(du\T*). Recall that 

Y n(T(j:n)) =Tj-X. 

4.1. Remarks on prior fixed points of discontinuity. We have so far omit- 
ted any discussion on the form of the posterior distribution when there are 
prior points of discontinuity as in A^. defined in (5). In fact, the analysis is es- 
sentially already contained in our results. Recall that for n > 1 the posterior 
process for A in the complete data is A* = A„ + J2j=i Jj,n$T (j . n) ,x* , where 

the (Jj, n ) are conditionally independent of A n ,. Using the fact that A& and A* 
are the same structurally, one can simply let n(p) play the role of k and let 
{Ui,si,wi} play the role of { J j>n , Tq . n) , X*}. Let ni = \{i:(Ti,Xi) = (si,wi)}\ 
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for I = 1, . . . , k. In addition, let (T^ . n ) , Xf) denote < n(p) < n unique val- 
ues distinct from {(si,w\), . . . , (sk,Wk)}- Then it is easy to see that the 
posterior distribution of Aj, is of the form 

k n(p) 
1=1 3=1 

where F{Ui ■ „, € du\si} oc u ni (1 - u) Ynif>l) Hi(du) for I = 1, . . . , k. Note here we 
useY n (s) = Y:7 =1 I{Ti>s}. 

Remark 7. Note that marginalizing over 3C , the result in Proposi- 
tion 4.1 reduces to the appropriate analogous results for NTR processes 
described in [9, 14, 15, 21, 29]. However, we shall present a considerably 
streamlined and direct proof that uses a methodology applicable to a much 
wider class of random probability measures on abstract spaces. Note more- 
over that there is no analogue of Proposition 4.1(i) appearing in those works. 
The distribution of F*(oo, dx) corresponds to the posterior distribution of a 
new class of random probability measures, which we discuss in more detail 
in Section 5.3. 



5. Analysis of NTR generalizations of the Blackwell MacQueen distribu- 
tion. We now present a detailed analysis of the marginal distribution ^# 
and related quantities. We give details for Lemmas 5.1 and 5.2 in the Ap- 
pendix. First we introduce some additional notation. For a homogeneous p 
or r and for u> > 0, let 

• i 



<K<») 



(i 



-uy 



)r(dy) 



(l-(l-u)Xdu) 



u>(l — u) 



Ul-1 



p(dv) 



du. 



This is the Levy exponent defined by the Laplace transform of a homogeneous 
Z process. For integers (i, k), let 

poo fl 

^, fc («) = / (1 - e-^)e-"*r(dy| a ) = / (1 - (1 - u) l )(l - u) k p{du\s). 
Jo Jo 

In the homogeneous case, set k = Jo°°(l — e~ yi )e~ yk r{dy) and note that 
for each j, (f>(J) = ipj.o = /o°(l — e~ jy )r(dy). Finally, we define cumulants 

1 u m *(l-u) r *- 1 p(du\s) 



and 



Ms) 



u m i(l-u) r J-ip(du). 
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Our first task will be to obtain a nice expression for the expectation of 
the product of survival functions that appears in (6). First notice that 



(9) 



i=i 



n(p) 



n(p) 

II s ( T (j--ny 



These equivalences lead to the following result. 

Lemma 5.1. Let v(du,ds,dx) = p(du\s)Ao(ds , dx) be the mean intensity 
of a Poisson random measure N . Then 



E 



n(p) 
L 3=1 



n (p) . 

n e/< 

5=1 



: ") 



V , m,r._ 1 (s)A (a!s) 



XTie expression reduces to YYj=i exp(— J ' ijjij-i(s)Ao(ds)) when there 
no ties. 



are 



of 



Lemma 5.1 is instrumental in obtaining the following initial description 



Lemma 5.2. Let j$ denote the exchangeable distribution o/(T,X) de- 
fined in (7). Then -#(dT,(£X) can 6e expressed as 



n (p) . 
i=l 



(j : ") 



V , m j ,r_ 1 (s)Ao(rfs) 



n(p) 

1=1 



We now show how one can obtain calculations using For each m € 
<SVi(p) and integrable function <?(T), define 

/■oo roo poo n (p) r ti , , . . , , . 

L{g;m)= / •••/ g((t,m)) [] ^' r - l(s)A ° (ds) 

X K mj ., rj „ 1 (p|tj)A (d^), 

where t\ > ti > ■ ■ ■ > t n ( p ) denotes one of n(p)! orderings of the unique 
values. With some abuse of notation, the vector (t,m) = (t) denotes the 
collection of n points whose n(p) unique values are ordered according to 
m. For example, suppose one has the function g(Ti,T2,T^). Then in the in- 
stance where T\ = T2 < T3, one has re(p) = 2 unique values and one evaluates 
9{T(2:2),T(2:2),T(i:2)) or 5 using the notation above, g(t 2 ,t 2 ,ti). 

We now use L to obtain very general formulae for expected values of 
complex integrals of NTR processes. This plays a key role in obtaining the 
EPPF 7r(p) and related quantities. 
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Lemma 5.3. Assume that the random functional J^(g) = J g(t) Yli=i F{dti) 
is integrable, where F is an NTR process specified by the Poisson law ~F(dN\v) . 
Then it follows from Lemma 5.2 that 



E L (a;m) 



In the homogeneous case, p(du\s) = p(du), the expression reduces to 

-n(p) 



E 



E 

meS„( p ) 



Ap) 



poo poo poo "(p) 

/ / •■•/ g(t,m)l[, 

Jo J t n{p) Jt 2 J=1 



-Ao(tj)ip. 



j )fm.j,Tj_i 



A (dtj 



Proof. The result follows from an application of Fubini's theorem and 
Lemma 5.2, which yields 



M 



J?{g)¥{dN\v)= / g(t)j?(dt,dx 



□ 



Remark 8. The case where g may depend also on X is obvious. It is 
important to note that Lemma 5.3 may viewed as a generalization of Lo 
([32], Lemma 2). 

We now use Lemma 5.3 to obtain a simpler description of the distribution 
of (T,X), which also yields easily the EPPF formulae and a corresponding 
distribution on (m,p). Note again that we do this without resorting to the 
types of combinatorial arguments used, for instance, in [1] and [32]. 

Proposition 5.1. Let (T,X) denote the random variables with the ex- 
changeable distribution jtft described in Lemma 5.2. Then this distribution 
may be expressed in terms of a conditional distribution o/T,X|m, p and a 
distribution of (m, p) as follows: 

(i) There exists a marginal distribution o/T,X[m, p, given by n(dT, 
dX|m, p) proportional to 



n(p) 

ne-/c 

L j=l 



T : ") , f \ A /J S 

o *ry,»v_i«Ao(«to) 



- n(p) 

-i (p\ T (j : n)) II A 0( dT (j : n),dX*), 
. 1=1 



where Th :n ) > T^-.n) > •■• > Tr n f p y n -\ denotes the order statistics of the 
unique values T*. In the homogeneous case the result reduces to 



7r(dT,dX|m,p) 



SPATIAL NTR PROCESSES 



13 



(10) 



n(p) 

II tin) 



n(p) 

IK 

3=1 



1pm j ,r j _ 1 A (T (j . n) ) 



n(p) 

IJ A (dT [j:n) ,dX*). 
1=1 



In both cases Yl^^i Po(dXj \T(j. n }) is the conditional distribution o/X|T,m. 

(ii) The distribution of (m, p), is described as follows. The EPPF derived 
by i.i.d. sampling from F is expressible as 

tt(p)= e L ^ m )- 



?l(p) 



The representations imply the existence of a joint distribution o/(m, p) given 
by 7r(m, p) =L(l;m). Additionally, in the case where p(du\s) = p{du), the 
formulae reduce to 



(iiMp)= E 



n n(p) 

117=1 " , m,-,rj 



m6S. 



?l(p) 



and 7r(m, p) 



Proof. Statement (i) follows from (ii) and Lemma 5.2. The proof of 
(ii) in the general case follows from Lemma 5.3 with g := 1. In the case of 
p(du\s) = p(du), 7r(p) is equivalent to 



E 

m GS'„(p) 



n(p) 

n 



/ / •••/ T e- Ao{t ^' r ^A {dtj). 
Jo Jt n(p) Jt 2 j=1 



It° Y[ n -i e~ A °( tj ^ m i' r i- 1 x 



The result is concluded by evaluating f °° / f ' 

Ao(dtj). This is done by noting that for any positive C, / f °° e~ CA °^Ao(du) 
C~ 1 e~ CA °( t \ In addition, ip mij o = <p(m{), and for each j, 0(rj_i)+^ mjjrj _ 1 
0(ry). □ 



Equation (10) in Proposition 5.1 can be used to deduce an explicit Markov 
property in the homogeneous case that has the interpretation that the dis- 
tribution of the next death time only depends on the previous death time. 
Moreover, it demonstrates that it is fairly simple to sample from (10). 



Proposition 5.2. Given (m,p) ; let T( 1: „),..., T( n ( p ) :n ) be distributed 
according to (10) . Moreover, set Ao(t) = t. Then, conditional on Tq +1 . n \ , . . . , 
Pn(p) > the distribution of Tu . n \ depends only on Tq^ = tj+\ and is given 
by the truncated exponential distribution with density 

P(T (j : n) € dtj\T u+1 : n) = t j+1 ) = ^e-*^-^ dtj 
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for tj > tj+\. In particular, the smallest value, or equivalently the first of 
n(p) death times, T^ p y ri ^, has a marginal distribution that is exponential 
with parameter 4>{n), that is, 

F(T {n{p) .. n) edy) = ( f>(n)e-^dy. 

5.1. Some connections to exponential functionals and means of NTR pro- 
cesses. We now relate some of our results to those of Epifani, Lijoi and 
Priinster [11] and Carmona, Petit and Yor [5] concerning moment formulae 
for means of NTR processes. Briefly, using the relationship in (2), Epifani, 
Lijoi and Priinster ([11], Proposition 5) established the following moment 
formulae, expressed in our notation, that characterizes the distribution of /: 

/*oo /*oo roc n / rtj roo 

E[I>]=n!/ / ■•■/ Il ex P-/ / (i-e - ") 



(12) 



Jt n{p) J t2 j 



= 1 



Jo 



x e- y{ - j -^T{dy\s)A {ds)) dtj. 



The authors also provide conditions under which the moments exist, 
which amounts to the finiteness of the moment of order n of Fq; that is, 
Jq°° t n Fo(dt)< oo. In addition, when p(du\s) = p(du) and Ao(i) = t, the ex- 
pression in (12) reduces to the interesting formulae of Carmona, Petit and 
Yor ([5], Proposition 3.3), viewed within the context of exponential func- 
tionals of a subordinator, 

n ' 

(13) E[J>]= , 

Uj=inj) 

Notice that the specification Ao(t) = t is equivalent to specifying Fq as an 
exponential(l) distribution. In addition, Carmona, Petit and Yor ([5], Propo- 
sition 3.1) establish the following result for any A > 1 and more general Levy 
processes: 

Lemma 5.3 offers a complementary result to theirs in that one can express 
E[/ n |i/] in terms of sums over partitions p. Apparently, for NTR processes, a 
result of this type is only widely known in the case of the Dirichlet process, 
which follows as a special case of Lo [32]. The result is as follows. 



Corollary 5.1. Let I be defined as in (2). Then setting g(t) = n^=i t 

rrij 
(J) 



n?=i i- n Lemma 5.3, one has I = J?(g) and hence 



E[J>] = £ 



L (f; m ) 



P LmeS„ (p) 
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In particular, in the case where p(du\s) = p(du) and Ao(t) = t, Lemma 5.3 
combined with the result of Carmona, Petit and Yor [5] yields the identity 



E 



E 



meS. 



n(p) 



■n(p) 

n 

■i=i 



-i(p) 



/ / •••/ ] [ '7'' <><:, 
Jo Jt n(p) Jt 2 j=1 



Another relationship to the formula for E[/ n |z/], (13), given in [5], is seen in 
the next corollary, derived from Proposition 5.1, which describes the formula 
for the case where all cells are of the same size. 



Corollary 5.2. Suppose that p(du\s) = pidu) and n = kn(p). Then 
with respect to the EPPF given in (11), the probability of the event p = 
{Ei, . . . , E n f p \}, such that the size of each cell is k, is 



TT{p) 



"(PWS J? «*(!"«) 



(j-i)fc 



p(du) 



As special cases, when n(p) = n, the probability of no ties in the sample 
corresponds to the probability of the event p = {{1}, {2}, . . . , {n}} given by 



7r(p) 



UU4>(j) 



E[i»n 



u(l — uY 1 p(du), 



for E[I n |i/] given in (13). When n(p) = 1, p = {1, 2, . . . , n} corresponds to 
the event that all the values in the sample are the same, and the probability 
is given by 



7T(p) 



Jo 1 u^p(du) _ J™ (1 - e-y) n r(dy) 



/o°°(l 



-ny 



\r(dy) 



Remark 9. The event of no ties, n(p) = n, corresponds to the common 
assumption in the literature for observed data. Analogous to Antoniak [1] 
for the Dirichlet process, it follows that when n(p) = n, using Corollary 5.2, 
the distribution of T,X|p in the homogeneous case is 



Efrvp 1 



II' 

1=1 



-Ao(T( i:n ))^i,i-i 



J j=l 



Remark 10. Gnedin and Pitman [18], independent of this work and 
by different arguments, obtain formulae for what are called regenerative 
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compositions that contain our results in (11). Their formulae are derived 
from a discretization of subordinators. In fact, the authors show that all such 
regenerative compositions are determined uniquely by their construction via 
subordinators. The authors' result is more general, in the homogeneous case, 
because they include the result for subordinators with drift components. It 
is, however, a simple matter to adjust our results to allow for a drift (see [24], 
Remark 28). They do not cover the inhomogeneous cases we consider. We 
discovered these connections through a mutual exchange of manuscripts in 
progress. The authors' description via a decrement function and composition 
structure contain additional binomial coefficients. Explicitly in terms of our 
notation, their composition structure is expressed as 

re! . . 
^ 7r(m,p). 

ii.r; ; <,! 

The authors identify some particularly interesting composition structures 
and we will show how this translates into an interesting class of spatial 
NTR models. See also [10, 17, 35] for relevant references. See also [19, 20] 
for important results related to the rates of various re(p). 

5.2. Sampling modified Chinese restaurant processes. Propositions 5.1, 
5.2 and 4.1 dictate how one might sample (T,X) from ^ . This is especially 
true in the homogeneous case. One proceeds essentially by first obtaining a 
draw of (m, p) from 7r(m, p), then using Proposition 5.1 or 5.2 to draw the 
ordered unique values (T(j . n ) ) from the relevant truncated exponential dis- 
tributions. The X* are then drawn from Po(dXj\T(j. n ^) for j = 1,... ,re(p). 
Additionally one can then (approximately) draw F\T, X, by using the rep- 
resentation F* from Proposition 4.1, which suggests to draw (Jj, n ), and 
then applying methods in the literature to approximate quantities such as 
A n (see, e.g., [4]). These are precisely the type of steps that would lead to 
efficient approximations in more complex mixture models, that is to say, 
models where (T,X) are missing values obtained from and are not di- 
rectly observed. Also, by sampling from ^ one can approximate quantities 
such as those that appear in Lemma 5.3. In this section it is shown how one 
might generate (m, p) from 7r(m, p) in the case where p(du\s) = p(du) via 
a sequential seating scheme with probabilities derived from the prediction 
rule given (m, p). This idea also holds in the nonhomogeneous case. The 
scheme bears similarities to generalized Chinese restaurant processes that 
can be used to generate general EPPFs, 7r(p) =p(e±, . . . ,e n r p \). Using the 
description in [37], page 60, the generalized Chinese restaurant scheme as- 
sumes that an initially empty Chinese restaurant has an unlimited number 

of tables labeled 1,2, Customers numbered 1,2,... arrive one by and 

are seated sequentially according to probabilities derived from ratios of the 
EPPF. Basically customers are seated with probabilities that depend on the 
size or number of customers already seated at the existing tables. 
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5.2.1. Ordered generalized Chinese restaurant processes. In general, to 
draw from p(mi, . . . ,w, n ( p )) := 7r(m, p), we introduce a new scheme, which 
is a modified Chinese restaurant process that also records the rank of the 
entering customers relative to the already seated customers. The first cus- 
tomer is seated and assigned an initial rank of 1. Now, given a configuration 
based on n customers seated at n(p) existing tables labeled with ranks from 
j = 1, . . . , n(p), the next customer n + 1 is seated at an occupied table j, de- 
noting that customer n + 1 is equivalent to the jth largest seated customers, 
with probability 

_p(...,m j + l,...) 

P j : n 



p(mi,...,m n(p) ) 

(14) 



«m J+ l,r- J _ 1 (p) n?=ill Km,,r,_i+l(p) ^ <p{n) 



n 



«m 3 -,r j _ 1 (p)nSl«m I ,r J _ 1 (p) f4 ^ + *) ' 

Customer ra + 1 is seated at a new table with probability 1 — J2j=i Pj-.n- 
However, if customer n + 1 is new, it is also necessary to know the customer's 
rank and as such to rerank by one position all customers smaller than the 
new customer. Hence the probability that customer n + 1 is new and is the 
jth. largest among n(p) + 1 possible ranks is 

_ p(...,mj_i,l,mj,...) 
p(?ni,...,m n(p) ) 

with q n (p)+i; n = K\ t n{p) / 4>{n+ 1). Note that in the calculation of Ki )rj l (p), 
+ 1 is to be used rather than rj = rj_i + rrij. 
As an example, consider the choice of a homogeneous beta process that 
corresponds to c(s) = in (4). Then it is easily seen that 4>(rj) = ^2^0/(0 + 
/ — 1) and it follows that, in this case, 

mj ffi <f>(n) 



n + fj. 0(n + l) 



and 



n(p) 

i=j 



4>{n) 

71 + 9 EliT +1 1/(0 + » - 1) f=j ^ + 
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5.3. Species sampling models generated by spatial NTR processes. The 
availability of the EPPF, coupled with Pitman's [34] theory of species sam- 
pling random probability models, implies that there exists a new explicit 
class of random probability measures of the form 



/•oo 00 

(15) Pf(-)= S(s-)A(ds,-)=J2QiS Zi (-), 

Jo i=i 

where Z^ are i.i.d. random elements in SC with some nonatomic law Pq and 
where, independent of (Zi), (Qi) denotes a collection of random probabilities 
that sum to 1 and whose law is completely determined by the EPPF vr(p) 
given in Proposition 5.1. We will call Pp an NTR species sampling model. We 
do point out that although there are technically a large number of possible 
species sampling models, to date there are only two well-known classes: the 
species sampling models based on the Poisson-Kingman models described 
in [38] (see also [24]) and the stick-breaking models described in [22]. See 
also [40]. 

The NTR species sampling model, which is defined for the first time here, 
represents a third case where, due to the present analysis, much is known. 
All three classes contain the Dirichlet process. In fact, rather remarkably, 
all three classes contain the two-parameter (a, 9) Poisson-Dirichlet family of 
random probability measures with parameters < a < 1 and > 0. We will 
describe this in a forthcoming section. The next proposition describes how 
one can always formally obtain an NTR species sampling model generated 
by an F with an independent prior specification, Fo(ds,dx) = Fo(ds)Po(dx). 
Moreover, we give a description of its posterior distribution. 

Proposition 5.3. Let v(du, ds, dx) = p(du\s)Ao(ds, dx) denote the mean 
intensity of a Poisson random measure N on W , where Aq is chosen such 
that AQ(ds,dx) = Ao(ds)Po(dx). Then the corresponding spatial NTR pro- 
cess, F, generates an NTR species sampling model, Pf, given in (15), by the 
representations Pp(dx) := F(oo,dx) = J °° S(s—)A(ds,dx) or, equivalently, 
the marginal distribution o/X = (X*,p) is given by 



E 



\P F {dXi)\v 



i=l 



n.(p) 



vr(p) J] P (dX*). 



=1 



Additionally, the posterior distribution of Pf given (T,X), or just X, is 
characterized by Proposition 4.1 and 5.1. Specifically, it is equivalent to the 
appropriate conditional laws of the random measure F*(oo,dx). 

PROOF. Under the specifications F (ds, dx) = F (ds)P (dx), ^$C(dT, dX) 
is such that given p, the vectors T* and X* are independent, where X* has 
joint law n?i^ Po{dX*). The result is concluded by integrating out T*. □ 
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It is interesting to note that while the Dirichlet process is an example of 
Pf, it also arises without the independence specification. In most cases 7r(p) 
will not be easy to work with directly; as such one can work with 7r(m,p). 
As an example, we present a description for the prediction rule of Pp given 
(X, m). It will be clear that one can employ the ordered generalized Chinese 
restaurant algorithm in Section 5.2.1 to draw easily from a joint distribution 
of (X, m). 

Proposition 5.4. Let Pf denote an NTR species sampling model de- 
fined by the choice p(du\s) = p(du). Suppose that X = {X\, . . . ,X n } given Pf 
are i.i.d. Pp. Then one can define a prediction rule for X n+ \ given X, m as 

/ «(p) \ «(p) 
P(X n+1 G dx|X,m) = 1 - ) p o(dx) + ]T 

\ j=l J j=l 

where (pj :n ) ar & given in (14). Note also that 7r(m, p) 11^=? Po{dXj) is the 
distribution of (X,m), which means that the distribution of X|m is such 
that the unique values {X*) given m are i.i.d. Pq. The prediction rule given 
X is obtained by P(X n+ i G ete|X) = J2 m es ^{X n +i £ dxlX,m)7r(m|p). 



6. Examples. 



6.1. Generalized gamma models. An interesting class of measures is the 
family of generalized gamma random measures discussed in [4]. Using the 
description of Brix [4], these are Z processes with Levy measure 

Ta,b{dy)A (ds,dx) = - — — — -y-*- 1 exp(-by) dyA (ds, dx), 

a)6 (l)r(l - a) 

where (/> a ,b(l) = ^[(P + l) a - (b) a ]. The values for a and b are restricted to 
satisfy < a < 1 and < b < oo or — oo < a < and < b < oo. Different 
choices for a and b in p a j, yield various subordinators. These include the 
stable subordinator when 6 = 0, the gamma process subordinator when a = 
and the inverse-Gaussian subordinator when a = 1/2 and b > 0. When a < 0, 
this results in a class of gamma compound Poisson processes. Generalized 
gamma NTR processes with b > are discussed in [11]. Here, from our 
results, 



[{rj + br ^ {rj _ l+br] 
i> r i-i [(1 + &)«-&«] 

]) [{l+b) a -b a } 
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and 

( ES)(-l) t+1 (?)(& + r- 3 --i + tt 

Hence 



vr(m,p) 



The process F(ds,dx) is such that marginally F{ds,3£) is a generalized 
gamma NTR process and Pp(dx) = F(oo,dx) is a species sampling model. 
Additionally one can use Proposition 5.2 to generate the Tu. n \. In particu- 
lar, when 6 = and Ao(t) = t, the density corresponding to the stable process 
with index < a < 1 is 

P(T (j:n) e ^|T (i+1:n) = t J+1 ) = r « e -Tte-*i+i] for ^ > t i+1 . 



6.2. T7ie spatial NTR two-parameter Poisson-Dirichlet model. We now 
describe perhaps the most remarkable class of spatial NTR processes. Gnedin 
and Pitman ([18], Section 10) were able to deduce that one can generate 
the EPPF of the two-parameter (a, 6) Poisson-Dirichlet distribution with 
parameters < a < 1 and 9 > by specifying a homogeneous p such that 

m rx*)' j"!^i --(i-)' 



and, hence, 



r(i-a)r(i + 0) 

r j T{e + r j )T{e + 2-a) 
T{9 + l)r(0 - a + r* + 1) ' 



Due to Proposition 5.2, this is enough to generate the distribution of the 
(T(j:n))- Note for this model one can directly sample from the well-known 
EPPF. 



6.2.1. The ordered ESF and the Dirichlet process. An interesting case 
is when a = 0; that is, p{du) = 6(9 + 1)(1 — u) 9 ~ l du. This choice generates 
the ordered Ewens sampling formula as described in [10]. Moreover, the 
spatial NTR process F(ds, dx) is such that F(ds, X) is an NTR process but 
not a Dirichlet process, and it follows from Proposition 5.3 that Pp(dx) = 
F(oo,dx) is a Dirichlet process with shape 9Pq. Hence, this shows that a 
Dirichlet process may be generated via a homogeneous NTR process derived 
from a compound Poisson process. Note of course that when x E TZ, this 
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process is marginally an NTR process in both coordinates. Setting Ao(i) = t, 
the corresponding distribution of the Tfj . n ) is given by 

P(T ( , :n) G ^|T ( , +1:n) =t i+1 ) = ^h e -((^/(^))fe-*i + i] 

1" + r j) 

for tj > tj+i. 

Note also that the distribution of the jumps that depends on (Ty :n )) is 

P(J ilB € du|Ty :n) ) = — ~~TywH~r^ 7U mj (1 - ': 
u ' T(mj + 1)1(6 + rj-i) 

that is, they are beta distributed with parameters (rrij + 1,6 + fj-i)- Note 
that these are not the jumps of a posterior Dirichlet process. However, since 
F(oo, dx) is a Dirichlet process, its posterior distribution given X is a Dirich- 
let process with shape 6Pq + J2?=i $Xi ■ 

6.2.2. Representations for the general two-parameter (a, 6) case. We can 
use the result above to provide some new results related to the two-parameter 
{a, 6) Poisson-Dirichlet family and spatial NTR processes. For clarity, we 
first recall the definition of the two-parameter (a, 6) Poisson-Dirichlet class 
of random probability measures. The two-parameter (a, 6) Poisson-Dirichlet 
random probability measure with parameters < a < 1 and 6 > has the 
known representation 

P a ,9[dx) = —= , 



where fj, a> Q is a finite random measure on 3C with law P(d / Li Cfj £)) and where 
T a ,e — ^u,e{%-) is a random variable. The law of the random measure \i a fi 
can be described as follows. When a = 0, //o,e is a gamma process with shape 
6Pq; hence, Po,6» is a Dirichlet process with shape 6Pq. When 6 = 0, ^ Qj o is a 
stable random measure of index < a < 1. Note that both /U a o and /Uo,0 are 
completely random measures and can be represented in terms of a Poisson 
random measure. However, this is not true for the case where both a and 
6 are positive. Here for < a < 1 and 6 > one has the absolute continuity 
relationship 

t- *p(^ Qj0 ) 



where T Qj o is a stable law random variable. This class of models also has 
a representation in terms of stick-breaking processes. See, for instance, [22, 
34, 39] for further details. We now arrive at the following interesting obser- 
vations. 
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Proposition 6.1. Let F(ds,dx) denote a spatial NTR process specified 
by the choice of p in (16) and let Fo(ds,dx) = Po(dx)Fo(ds). Then Pf is a 
two-parameter (a, 6) Poisson-Dirichlet process. This yields the representa- 
tions 

P F (dx)= / S(s-)A{ds,dx) 
Jo 

oo k— 1 

= E^II( 1 -^)<M^) 

k=l i=l 

= (J, a ,o(dx)/T a> = P at9 (dx), 

where (V&) are independent beta (1 — a,9 + ka) random variables independent 
of the (Zfr), which are i.i.d. Pq; that is, a two-parameter (a, 9) Poisson- 
Dirichlet process can be represented as the marginal probability measure of 
a spatial NTR process as described above. 

Proof. The general result follows from an application of Proposition 5.3 
combined with the calculations of the EPPF using p in (16) by Gnedin and 
Pitman ([18], Section 10). The case of the Dirichlet process that corresponds 
to the choice of p(du) = 9(9 + 1)(1 — u) 9_1 du could be deduced as well from 
[10] in combination with Proposition 5.3. See also [35] for the (a, a) model. 
□ 



APPENDIX 

Proofs of Proposition 4.1 and Lemmas 5.1 and 5.2. We now show that 
the proofs of Proposition 4.1 and Lemmas 5.1 and 5.2 follow as a sim- 
ple consequence of the Poisson partition calculus methods as laid out in 
[24, 26]. First set W t = (Ji,T h Xi) for i = 1, . . . , n, elements of W . The col- 
lection J = { J\, . . . , J n } with values in [0, 1] will play the role of the latent 
jumps. Its unique values are the (Jj>)- Set W = (J,T,X) and let Wj = 
( Jj jH , T(j . n ), X*) denote the j = l,...,n(p) unique triples. Using Proposi- 
tion 2.3 of [26] yields the following statement. Suppose that (W, N) are 
measurable elements in the space W n x M, where N is Poisson random 
measure with sigma finite nonatomic mean measure v. Then for each non- 
negative measurable / such that ^(/) < oo, the following disintegration 
holds: 

n 

Y[N(dWi) 

.i=i 

(17) 

= e*Wv{dN\v f , W) [] e- f{W ^u(dW*), 



e - N (f)-p(dN\v) 
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where F(dN\vf, W) denotes the law of the random measure N + J2j=i $w* > 
where N is a Poisson random measure with mean intensity W[N(du,ds,dx)\vf\ 
Vf(du, ds, dx) := e~^ u,s,x ^v(du, ds, dx). 

To apply the results above we first express (9) in terms of an exponen- 
tial functional of a Poisson random measure as follows. For each j, set 
fT {j . n) _(u,s,x) = I{s < T(j :n )}[— log(l — u)]. Now it follows that one can 
define 

n(p) 

f n (u,S,x) = m jfT (j:n) A u , s , x ) = -V n (s)log(l - U) 
3=1 



and hence one has 



n(p) 

II S ( T U--n)- 
3=1 



n(p) 
3=1 



-N(fn) 



Note also that e~ f ^ u ' s '^ = (1 - u) Yn[s) and e~ MJ '' n > T <* !n ')> x i' ) = (1 - J J >) ri_1 
for j = l,...,n(p). 

The next step is to write A(dTi,dXi) = Jq JiN(dJi,dTi,dXi). Now re- 
moving those integrals in (6) yields an augmentation of the distribution of 
(T, X, N) in terms of a distribution of (J, T, X, N). It follows that the distri- 
bution of (J,T,X, N) can be expressed similar to the left-hand side of (17) 
with f n in place of / as 



n(p) 

n J m 



j 

J." 



3=1 



\N{dJi,dT u dXi 



8=1 



Note that n?=i Ji = 11^=1 Jj*n • Hence now applying the right-hand side 
of (17) one has that the joint distribution of (J,T,X,iV) is given by 

P(dW|e~ /n i// n ,W)e -sf(w 



- N ^¥(dN\u). 



(18) 



n(p) 

U&-Jj,np- 1 p(dJ j , n \T ij:n) ) 
3=1 



n(p) 

i[Ao(T?,Xt) 

i=i 



where E[e N ^ n ">\v\ = e ^(M and now F(dN\vf n , W) corresponds to the law, 
for fixed W, of a random measure N n + J2j=i $Jj „,T (j . n) ,x* > where N n is a 
Poisson random measure with mean described in (8) and W(dN\i>f n ,'W) is 
the posterior distribution of JV|J,T,X. The joint distribution of (J,T,X) 
is obtained by integrating out N in (18). Now using the fact that one can 
decompose (J,T,X) as (( Jj <n ), T*, X*, p), it follows that an expression for 
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the marginal distribution of (T,X), or equivalently (T*,X*,p), is obtained 
by integrating out N and the (Jjn)- For clarity, this takes the form 



^(dT,dX) = e 







n(p) 



II K m ^ rj _M T {j:n)) 



3=1 J 1=1 



n(p) 



The description of the posterior distribution of iV|T,X is given in terms 
of the distribution of iV|J,T,X mixed over the distribution of the i^Jj^n) 
given (T,X). The distribution of (Jj, n ) follows by an appeal to the classi- 
cal Bayes rule; that is, one integrates out N in (18) and then divides the 
remaining quantity by ^ . This yields the results in Proposition 4.1. Now 
it follows that the description of j$ given in Lemma 5.2 is completed by 
verifying Lemma 5.1. This is obtained by using repeatedly the exponential 
change of measure described in Proposition 2.1 of [26]. This is the same 
as working with (17) after removing all the terms that involve W; that is, 
the disintegration e- N( -^P(dN\u) = F(dN\u f )E[e~ N( -^\u}. We apply this re- 
peatedly to the measure [JJ^i e~ N(mfT U :n )-' ) ]P(dN\v). To see this, first set 
gj := Tn,jfTy. n x for j = 1, . . . , n(p) and let each gj now play the role of an 
/. We demonstrate the first two steps. Notice that the first term is obtained 
as 

e- N{9l) F{dN\v) = F{dN\v gi )e- J° (1]n) VwoWM^) 
= F(dN\v gi )E[e~ N ^\i'}. 
The next term is obtained as 

e - N{92) F(dN\u gi ) = F(dN\v gi+g2 )e- ^ *' ^.n MM*) _ 



The last expression follows from the fact that for s < Tf 2:n \, e 92(s,u,x) 
! 1 — u) m2 and e~ 9l<yS,u,x ^ = (1 — u) mi with n = mi. The next term would 
then exploit this type of relationship for g\ , g 2 and g% on s < T( 3 . n \ , where 
mi -\-Tri2 =t<i. It is clear that continuing in this way leads to the conclusion 
of Lemma 5.1. 



Remark 1 1 . More details, including an analysis of semiparametric mod- 
els subject to censoring mechanisms, is given in an older version of this 
manuscript [25]. 
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